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I provide an incomplete inventory of the astronomical variability that will be found by next-generation time-domain 
astronomical surveys. These phenomena span the distance range from near-Earth satellites to the farthest Gamma Ray 
Bursts. The surveys that detect these transients will issue alerts to the greater astronomical community; this decision 
process must be extremely robust to avoid a slew of "false" alerts, and to maintain the community's trust in the surveys. 
I review the functionality required of both the surveys and the telescope networks that will be following them up, and the 
role of VOEvents in this process. Finally, I offer some ideas about object and event classification, which will be explored 
more thoroughly by other articles in these proceedings. 
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1 Introduction 

Next generation surveys such as Pan- STARRSQ and LSS10 
promise to open up the time domain of astronomical vari- 
ability to the general community as a service. This will al- 
low the global study of all on-going phenomena in real- 
time, enabling both the small aperture amateur astronomer 
and the large aperture professional. It also places great re- 
sponsibility on the surveys themselves to provide a reliable 
stream of information. 

The scope of the planned data release is unprecedented; 
jq5-6 transient "alerts" are predicted to be generated on 
a nightly basis by the LSST alone. This will place a huge 
burden on follow-up networks in the very near future. Un- 
dertaking follow-up of these alerts could easily consume 
all of the available global telescope resources unless intel- 
ligent decisions are made about which events to focus on. 
The volume of the data streams will preclude this decision 
from being made by a human. 

To enable intelligent, autonomous follow-up systems, 
the VOEvent protocol has been developed as a means of 
automatically conveying information between astronomical 
resources. To take advantage of this stream, Heterogeneous 
Telescope Networks (HTNs) are being implemented to un- 
dertake follow-up of alerts. This purpose of this conference 
is to address the practical implementation of this marriage. 
This paper will address the types of astronomical variability 
the surveys will have to sort through, the types of alerts that 
the surveys will generate, and present various ideas on the 
classification of these alerts. I will emphasize that the sur- 
veys must do more than generate alerts; they must also listen 
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to the follow-up results of the community or risk retaining 
outdated classifications based upon their own limited data. 

2 Sources of Astronomical Variability 

The exciting part about surveys such as LSST is that they in- 
tend to detect, classify, and release information on all vari- 
ability found whether it be photometric or astrometric. This 
provides a technical challenge for the surveys in terms of au- 
tonomy and reliability that has been approached but not yet 
demonstrably met in precursor efforts (Bailey et al. 2007; 
Becker etal. 2004). 

Modern surveys use image subtraction techniques to re- 
move the static portion of their images, leaving only the 
residual flux of objects that have varied in brightness or po- 
sition. In these difference images, the astronomical signal 
is almost exclusively objects elongated due to astrometric 
motion, or positive or negative point sources that have var- 
ied in brightness or position. I provide below an incomplete 
listing of astronomical phenomena expected to be found in 
these images, starting with the foreground of astrometrically 
variable objects and ending with the most distant of cosmo- 
logical explosions. 

2.1 Astrometric Variability 

Earth-orbiting satellites provide the least interesting (for 
most scientists) but most destructive foreground. They move 
at an angular velocity of order 10 4 " s _1 , and completely 
traverse a single field-of-view in a typical exposure time. 
Their image signature is a nearly infinitely elliptical streak. 
Inactive satellites may also be tumbling, which yields a pe- 
riodic sequence of flashes along the satellite's vector. Most 
photometry software is not designed to accurately model 
such a trail, and thus will deblend the trail into numerous 
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elongated sources. This sequence of detections may be iden- 
tified in a database using e.g. a Hough transform (Storkey 
etal. 2004). 

Solar system objects move with an apparent angular ve- 
locity that is a combination of their own spatial velocity (the 
dominant term for nearby objects) and the Earth's reflex 
motion (dominant for distant objects). These objects may 
appear elongated depending on their apparent angular mo- 
tion, angle from opposition, and exposure time. Near-Earth 
objects (NEOs) have closest approaches of order 0.001 AU, 
where their apparent sky motion is as fast as 1" s _1 . Main 
belt asteroids are found between 1 and 2 AU from Earth, or- 
biting between Mars and Jupiter. Typical angular velocities 
are 10" hr _1 . Trans-Neptunian objects (TNOs), or Kuiper 
belt objects (KBOs), orbit within or beyond the orbit of 
Neptune near 40 AU. Moving at ~ 1" hr _1 , they appear as 
single-epoch point sources in all but the longest astronom- 
ical exposures, and are difficult to distinguish from back- 
ground transients. An ensemble of moving objects imaged 
nightly will yield a "new" detection per object per image. 
These single-epoch detections may be efficiently linked in- 
to orbits (Kubica et al. 2007), with any orphaned detections 
potentially background variability. Main Belt asteroids and 
TNOs are concentrated strongly in the ecliptic plane, which 
may be used as a prior to disentangle them from background 
cosmological transients. Beyond the Kuiper belt, there are 
very few Solar System objects known. The Oort cloud of 
comets is thought to exist between 10 4 and 10 5 AU, how- 
ever no objects are currently known at this distance. This 
transition region reflects the boundary from objects that pri- 
marily reflect the Sun's radiation to objects that produce 
their own. 

Beyond this regime, astrometric motion is only notice- 
able in the nearest or most rapidly moving of our stellar 
neighbors. The extreme example of this is Barnard's Star, 
whose apparent angular motion is <~ 10" yr _1 . The differ- 
ence imaging signature of high proper motion objects is a 
dipole whose nodes grow more separate as a function of 
time, over the timescale of years. Since dipoles are also a 
classical signature of image subtraction failure, these may 
be difficult to distinguish from systematic noise until an en- 
semble of difference images is examined. 

2.2 Photometric Variability 

Beyond the solar neighborhood, all variability will have the 
higher order moments of the image's PSF (a noteworthy ex- 
ception are supernova light echoes, which leave large, low 
surface brightness features evolving radially from a cen- 
tral point; Rest etal. 2005). Since these variables all appear 
as point sources, contextual and temporal information are 
required to classify the nature of each event. I summarize 
some of the expected source populations below. 

Planetary transit searches are undertaken on nearby, ap- 
parently bright, stars out to a distances of a couple of kilo- 
parsecs (Pollacco et al. 2006). Their lightcurves are charac- 
terized by minute (several percent), periodic (several days) 



decrements in the host star brightness as it is transited by its 
exoplanet. 

At similar distances, a foreground "fog" of flaring M- 
dwarf stars has been found (Kulkarni & Rau 2006). This 
comprises low-mass stars that are too faint to be seen in a 
single image but which may flare suddenly, yielding short 
timescale (~ 1000 s) apparently hostless transients (Becker 
etal. 2004). The all-sky rate of these may be up to tens of 
millions per year, making them a significant foreground to 
cosmological variability. 

At a distance of 8 kpc, the Galactic center is used to 
backlight a foreground microlensing population, yielding 
(typically) symmetric, unique source brightenings (Paczyn- 
ski 1991). Durations of these events can be anywhere from 
10° to 10 3 days. Because of their angular coverage and tem- 
poral sampling, microlensing surveys have also yielded a 
wealth of information on stellar variability of all types. At 
around 50 kpc, the Large and Small Magellanic Clouds are 
targeted for microlensing by objects in our Galactic halo 
(Paczynski 1986), with typical timescales of 10 2 days. In- 
dividual stars in these galaxies are able to be resolved from 
the ground and photometered, although the blending can be 
quite severe. 

RR Lyrae are periodically pulsating horizontal branch 
stars that may be recognized from their lightcurve shapes. 
Their periods can range from 0.2 to 2 days. Because of their 
well-studied period-luminosity relationship, their apparent 
magnitude distribution can be used to infer Galactic struc- 
ture (Sesar et al. 2007). They have been found in our Galac- 
tic halo out to 100 kpc; LSST expects to probe Galactic ac- 
cretion structure using RR Lyrae out to 400 kpc. 

At a distance of 1000 kpc, M31 is also targeted by mi- 
crolensing surveys, but the lensed stars are unable to be 
resolved and difference imaging techniques are necessary 
(Uglesich etal. 2004). 

The HST Key Project measured lightcurves of Cepheid 
variable stars in nearby galaxies out to 10 Mpc to mea- 
sure the Hubble constant (Freedman etal. 2001). Cepheids 
are intrinsically several magnitudes brighter than RR Lyrae, 
meaning they can be seen to larger distances. They also have 
well calibrated period-luminosity relationships, with peri- 
ods of 5 to 30 days. 

Starting at around 100 Mpc (a redshift of z <~ 0.1), a 
wealth of nearby supernovae have been observed, starting 
with the Calan-Tololo sample of Type la supernova (Hamuy 
etal. 1996). Because supernovae are typically enmeshed in 
the light of their host galaxies, image subtraction techniques 
are required to photometer the supernova light uniquely. 
Contextually, supernovae are commonly (but not always) 
found near an extended host galaxy. Their lightcurves expe- 
rience a steep rise (~ 20 days) and a gradual fall over ~ 100 
days characteristic of their subtype (la, Ib/c, Hp, etc.). 

Just beyond this sample, around a redshift of 0.1, the 
closest Gamma Ray Bursts have been found (GRB 031203; 
Gotz et al. 2003). Their temporal evolution is much faster 
than supernovae, rising and falling within hours to days, 
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making optical discovery of these phenomena extremely di- 
fficult. 

Medium redshift supernova surveys like the SDSS-II 
Supernova Survey find events out to z ~ 0.4 (Frieman et al. 
2007). Higher redshift supernova surveys, including both 
ESSENCE (Wood-Vasey et al. 2007) and the Supernova Le- 
gacy Survey (SNLS; Astier et al. 2006), detect la events be- 
tween 0.3 < z < 1.0, while the highest redshift supernova 
surveys involving the HST find la supernovae out to z ~ 1.4 
(Riess et al. 2007). At their most distant, supernova are only 
visible for several days around their peak. Finally, the high- 
est redshift cosmological transient, GRB 050904, was found 
at z ~ 6.3 (Haislip etal. 2005). 

The overall extent of astronomical variability is clearly 
enormous, and individual surveys have typically been com- 
missioned to address a subset of the above whole. However, 
surveys such as LSST anticipate not only detecting and clas- 
sifying the above phenomena, but doing so in real-time. 

3 Alerts and Classification of Sources 

The general paradigm for alert generation is that a new event 
will be recognized by a real-time survey pipeline, and the 
survey will subsequently release an "alert" describing the 
detection (and possibly supporting characterization) obser- 
vations. The format of these alerts will be as VOEvent pack- 
ets. The decision to take action based upon a VOEvent is 
undertaken by "intelligent agents" as a part of each HTN. 
Depending upon the science goals of each HTN, different 
agents will make different decisions based upon the same 
information. These actions will depend upon the event clas- 
sifications of the survey and potentially of the agent itself. 
I summarize below some of the requirements necessary for 
this model to succeed. 

3.1 Alerts and Followup 

A VOEvent packet includes inference fields, where the 
survey lists to the best of its abilities the classification of 
the event, as well as the probability that the event is of 
this class. These alerts may be urgent in nature, suggesting 
immediate follow-up. The HTN resources decide to target 
or not based on the inference and probability, and 
their particular science goals. The surveys and follow-up 
networks will also be releasing more prosaic informational 
alerts that do not necessarily require action. These should 
be released each time an object that has been alerted on has 
been followed up. 

Each HTN's intelligent agent may be assumed to be 
autonomous from the others. Consequently, these different 
agents may come to different conclusions about the true na- 
ture of the event given the same information. This could 
easily lead to asynchronous/conflicting evolution of knowl- 
edge between networks. 

The instantaneous state of knowledge about an alert can 
be extracted from its ensemble of VOEvents by a citation 



mechanism that links multiple observations together - they 
are federated by mutual citation. As an alternative, the con- 
cept of a "broker" has been introduced representing an agent 
who centralizes and disseminates this information. As with 
the surveys, the broker's survival requires engendering trust 
from its subscribers. Surveys may play a hybrid role in this 
model, releasing VOEvents on the entirety of their alerts 
but also serving as brokers by releasing more descriptive 
alerts (including e.g. fit parameters) on particular subsets of 
events. 

3.2 Source Classification 

To release accurate alerts with a minimum of false posi- 
tives, i.e. to have trustworthy inference in VOEvents, 
model templates of event behavior must be built beforehand. 
To first order, there are two levels of classification requir- 
ing image-based (spatial) and lightcurve-based (temporal) 
models. 

3.2.1 Spatial Classification 

The morphological classification of flux in an image is a 
well-studied problem. All astronomical images have a char- 
acteristic point spread function (PSF), which is the trans- 
fer function of a point source through the atmosphere, tele- 
scope optics, and detecting instrument. Objects sharper than 
the PSF are not likely to be real astronomical phenomena; 
objects broader than the PSF may be noise artifacts or re- 
solved objects such as galaxies or comets. Moving objects 
will have the profile of the PSF in the dimension perpendic- 
ular to motion, and the PSF convolved with a line along the 
direction of motion 

Classification in difference images occurs through com- 
parison of the residual flux with the PSF profile. A spatial 
model of the PSF is typically built from the data itself. This 
step requires direct access to the images where the variabil- 
ity was detected. Given the potential bandwidth and disk 
access requirements of the alternative, this step is almost 
exclusively the responsibility of the survey. 

3.2.2 Temporal Classification 

Sets of detections may be linked into a lightcurve through 
mutual citation or by a broker. To recognize a given event 
as a certain class of phenomena, data models must first be 
built that span the parameter space characterizing the phe- 
nomena. The lightcurve data are then compared to each tem- 
plate lightcurve in a probabilistic sense, determining which 
model (if any) they fit best. These ideas are explored more 
fully in other articles in these proceedings (Bloom, Maha- 
bal, Bailey). 

A complication for these next generation surveys is that 
a data model incomplete at the e.g. 1% (or 0.1%, or even 
0.01%) level will result in an unacceptably large number 
of falsely classified alerts. The models need to be nearly 
bullet-proof. To build these models, both theoretical and 
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experimental priors should be taken into account. However, 
the observational data best span the range of actual (as op- 
posed to expected) phenomenology. This suggests that an 
ideal use of existing datasets is to build these models be- 
fore they are required by LSST and Pan-STARRS. A prime 
example of phenomenological model building is in the de- 
scription of supernova la lightcurves (e.g. Guy et al. 2007). 

One successful implementation of lightcurve classifica- 
tion driving real-time decision making is by the SDSS-II 
Supernova Search (Sako etal. 2007). The evolving light- 
curves are fit to various supernova lightcurve models af- 
ter each epoch of observation. SDSS-II find that after 2-3 
lightcurve epochs, approximately 90% of objects photomet- 
rically classified as la end up being la. While their analysis 
does not examine the efficiency of this process (how many 
la are missed), it is nevertheless an encouraging precursor 
effort. 

3.3 Data Access and Computational Requirements 

In the process of alerting and event classification, there are 
multiple stages of computation and inference undertaken by 
members of this community. The separation of responsibil- 
ities emerges by examining which aspects of the data are 
required to address each problem, and who has easiest ac- 
cess to it. 

As concrete examples, plausible alerts on moving ob- 
jects include : a known object may soon be lost, or its er- 
ror ellipse is growing at an unacceptable rate; an object is 
brighter/fainter than past behavior would indicate; or the 
likelihood of this object impacting Earth is significant. Each 
of these alerts requires computation and knowledge exchange. 
For the first, the future cadence of the survey is required; 
this sort of computation is best done by the survey. For the 
second, the past behavior of the object must be accessible or 
queryable. This is a requirement on the survey or on the fed- 
eration of all data on the object. The last use case requires 
significant computational resources to project all moving 
objects into the future. This computation is most likely to 
get done (with high latency) by a motivated user. 

In the use case of a newly detected transient, it is unclear 
who (if anyone) will have the final say in classification of 
the event. Especially in the early portion of the lightcurve, 
the inference will be changing rapidly as new data come in. 
A broker seems most useful at this critical stage, tying all 
data together into a coherent inference. 

Finally, there are the cases of known objects whose cur- 
rent behavior is unexpected. Examples include exotic devi- 
ations in an on-going microlensing event due to a planet 
orbiting the lens, stellar flares from low-mass stars, and de- 
viations in the timings of transiting systems due to the grav- 
itational influence of other unseen planets. In these cases, 
real-time lightcurve fitting is required (an extensive task), 
not likely to happen by the survey unless it also chooses to 
serve as a broker. 



4 Conclusions 

I have detailed some of the interplay between surveys and 
follow-up networks in time-domain astronomy, highlight- 
ing their potential roles and responsibilities. While the con- 
cept of data federation using VOEvent's citation mechanism 
may work, it may also lead to different inferences by dif- 
ferent resources. An alternative is to broker the evolving 
"truth" regarding an event by trusted agents in the system. 
These agents have the potential to fully realize (or scut- 
tle) the successful interplay between surveys and follow-up 
HTNs. 

The surveys and agents each play a role in the ultimate 
classification of each event. I emphasize that the surveys 
should also listen to the network, and decide if and how 
to allow these external resources to influence their internal 
event classifications. Finally, the need for accurate and pre- 
cise data models is commensurate with the data flow that 
will be compared against them. These models should be 
built sooner rather than later using currently existing data 
sets, to ensure that the promise of the first several years of 
these surveys is not lost to faulty or inaccurate alerts. 
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